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ABSTRACT 

The success of future large scale weak lensing surveys will critically depend on the 
accurate estimation of photometric redshifts of very large samples of galaxies. This in 
turn depends on both the quality of the pho tometric data and the photo-z estimators. 
In a previous study, (jBordoloi et al.l |20 ldT) we focussed primarily on the impact of 
photometric quality on photo-z estimates and on the development of novel techniques 
to construct the N(z) of tomographic bins at the high level of precision required for 
precision cosmology, as well as the correction of issues such as imprecise corrections 
for Galactic reddening. We used the same set of templates to generate the simulated 
photometry as were then used in the photo-z code, thereby removing any effects of 
"template error" . In this work we now include the effects of "template error" by gen- 
erating simulated photometric data set from actual COSMOS photometry. We use the 
trick of simulating redder photometry of galaxies at higher redshifts by using a bluer 
set of passbands on low z galaxies with known redshifts. We find that "template error" 
is a rather small factor in photo- z performance, at the photometric precision and filter 
complement expected for all-sky surveys. With only a small sub-set of training galaxies 
with spectroscopic redshifts, it is in principle possible to construct tomographic red- 
shift bins whose mean redshift is known, from photo-z alone, to the required accuracy 
of |A <2> < 0.002(1 + z). 

Key words: galaxies: distances and redshifts- cosmology: observations- methods: 
Statistical 



1 INTRODUCTION 

In the coming years, the next generation of all sky surveys 
plan to map the whole extragalactic sky in visible & near 
infra-red photometric bands. In order to achieve the perfor- 
mance required for precision cosmology, such surveys will 
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have a depth of Iab ^ 24.5, and contain multi-band photo- 
metric measurements of the order of ~ 10 9 galaxies over a 2n 
sr survey area. The use of these surveys for precision cosmol- 
ogy and for many other types of science will require the use 
of photometrically-estimated redshifts (hereafter photo-z). 
The use of photo-z for precisi on cosmology, e.g. in weak- 
lensing tomography ( |Hu| [l999|) , will require an impressive 
performance from the photo-z, not only in statistical accu- 
racy per object, but especially in the control of systematic 
biases in the mean redshifts of sets of galaxies used for cos- 
mological analysis. It is easy to derive that a precision of 1% 
in the dark energy equation of state parameter (w), requires 
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knowledge of the redshifts of the probes with a systematic 
accuracy of 0.002(1 + z). 

In a previous paper (| Bordoloi et alj |2010T) (hereafter 
BLA-10), we simulated the photometric redshift perfor- 
mance for a number of different photometric depths. These 
were defined as a combina tion of a Euclid-like infrared sur- 
vey (jRefreeier et alJl2010l ). complemented with additional 
ground-based photometry in the visible waveband. Artificial 
photometric catalogues were ge nerated from the moc k COS - 
MOS light-cone catalogues of iKitzbichler fc White] (120071 ^ 
and photo-2 of gala xies were produced us ing a template fit- 
ting code ZEBRA l|Feldmann et al.ll2006l ). We deliberately 
used exactly the same set of templates to construct the sim- 
ulated photometry as were then used in ZEBRA to estimate 
the photo-2. This choice eliminated the effects of "template- 
error" on the photo-2, by construction, allowing us to focus 
on other issues. We argued in BLA-10 t hat the exceptiona l 
performance of photo-2 in COSMOS fsee lllbert et al.l l2009). 
where deep photometry is available in a large number of fil- 
ters, suggested that template error was unlikely to be a dom- 
inant factor with the poorer and more limited photometric 
data that can realistically be anticipated over the whole sky. 
However, that analysis nevertheless represents an idealized 
situation, and a certain "circularity" of the argument. In 
this paper we aim to rectify this shortcoming by basing the 
simulated photometry on actual photometric data. 

Clearly, the ideal approach would be to take already 
available photometry that was comparable to that which will 
be available for the all-sky surveys, estimate the photo-2, 
and compare the performance with a complete set of reliable 
spectroscopically-determined redshifts. However, at present, 
there is no photometric survey that gives near-infrared pho- 
tomet ry at the depth envisa ged for e.g. Euclid (Y, J, H = 24 
at 5cr (Re fregier et alJ|201Cf )L nor is there a sufficiently com- 
plete spectroscopic sample that extends down to Iab St 24.5, 
where the redshift distribution extends well into the "red- 
shift desert" at z > 1.2. 

We can however perform the following trick. To first or- 
der, the photo-2 performance that we are interested in, i.e. 
on galaxies at some redshift using some given set of filters 
at some photometric depth, can be simulated using photom- 
etry on a set of brighter galaxies at slightly lower redshifts 
using a bluer set of filters. These photo-2 can also then be 
tested on a brighter spectroscopic sample at more benign 
(lower) redshifts. We argue below that degraded uBVrizJ 
photometric data from COSMOS, plus high quality redshifts 
from the zCOSMOS-bright redshift survey (| Lilly et al.ll2007l . 
|2009| ) at V «S 22.5 and 0.2 < z < 1.0 are well-matched (in- 
cluding in intrinsic luminosity) to a survey at 0.7 < z < 1.8 
with Iab SS 24.5, i.e. with a change in (1 + z) of a factor 
~ 1.4. Of course, the process is not perfect: it might be that 
templates (even at similar luminosities) become pathologi- 
cal between 0.2 < z < 1.0 and 0.7 < z < 1.8 but there is in 
our view not much evidence to support this. Perhaps more 
seriously, the number density of galaxies will be artificially 
low, by a factor of about four, so the problem of overlap- 
ping objects that we discussed in BLA-10 will not be fully 
accounted for. 

The paper is organized as follows: in Section 2 we de- 
scribe the photometric and spectroscopic catalogues that are 
used and the construction of the simulated photometric cat- 
alogue. Then we proceed to describe in Section 3 how the 



III 

: 


i i i i i i i i i 







1 1 1 1 1 1 1 1 1 1 


1 


- , 


fi 


i 2 


Y 

/I 

( 


\ 


J 


H 


1 


- 


I 




ill 




J 




J 




i 



0.4 0.6 0.8 1 .1.2 1.4 1.6 1.8 



Figure 1. Actual COSMOS filter response curves, after transfor- 
mation by a factor of 1.38 in wavelength (as described in Section 
2.3), compared with the nominal top- hat filters used in BLA-10 
to simulate an all-sky survey combining ground- and space-based 
data. There is a reasonable match in spacing and width. Since the 
current photo-2 analysis used these transformed filter response 
curves, the differences should not be important. 

photo-2 are estimated and finally in Section 4 how they per- 
form. Much of the performance analysis follow closely the 
approach of BLA-10 to which the reader is referred for more 
details. 



2 CREATING THE MOCK CATALOGUE 

Our goal is to simulate a realistic photometric survey con- 
sisting of visible photometry of the depth and filter coverage 
of planned surveys. In BLA-10, we defined three nominal 
surveys that were defined by increasing photometric depth 
in the visible, combined with NIR photometry of the depth 
expected from the proposed Euclid survey. The second of 
these, Survey B, which we argued had just sufficient per- 
formance, was intended to simulate the depth of the Pan- 
STARRS-23 or DEqJ surveys. For precise comparison with 
the results of BLA-10, we use this same intermediate depth 
Survey B for the present analysis. 



2.1 Input photometric data 

We use the COSMOS photometric catalogue which contains 
fluxes measured in 30 bands from Sub aru, HST, CHFT , 
UKIRT, Spitzer and G ALEX telescopes (Hibert et alj|2009h . 
We refer the reader to ICapak et al.l ([2008) for a complete 
description of observations, data reductions and the pho- 
tometry catalogue. For our present purpose we use only the 
uBVrizJ broad-band filter data from this catalogue as de- 
scribed in detail in section 2.3 . 



1 http:/ /pan-starrs. ifa.hawaii.edu 

2 http:/ /www. darkenergysurvey.org 
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2.2 Input spectroscopic data 

We use the spectrosc opic data from the zCOSMOS sur- 
vey l|Lillv et al.ll2007| y We use the so-called "20k-bright" 
sample of 20,000 galaxies selected at Iab ^ 22.5 (Lilly et 
al., in preparation). These galaxies were observed with the 
VIMOS/VLT spectrograph covering a wavelength range of 
5500 A < A < 9000 A at a resolution of 600. The average 
accuracy of individual redshifts is 110 km s , irrespective 
of redshift. 

Unfortunately, the s pectroscopic sam ple is not 100% 
complete. As described in lLillv et al. 2009, each redshift in 
zCOSMOS is assigned to a confidence class that is based 
on both a subjective assessment of the reliability of the 
spectroscopic measurement and whether there is a broad 
consistency between the spectroscopic and photometric red- 
shifts, i.e. whether |Az| < 0.08(1 + z). The photo- z come 
from from the full COSMOS photometric data set, as de- 
scribed in lllbert et al.l |2009i The actual reliability of the 
redshifts across the confidence class scheme is estimated us- 
ing the repeatability of the redshifts from duplicated spec- 
troscopic observations. In the current work we only make 
use of the redshift measurements having confidence classes 
( 4.x, 3.x, 9.5 + 9.4 + 9.3 + 2.5 + 2.4 + 1.5) which are 
dem onstrated from th e repeat observations to be 99 % reli- 
able (|Lillv et al.ll2009| ). The decimal places in the confidence 
class say whether there is broad agreement between the spec- 
troscopic and photometric redshifts, e.g. a decimal place of 
.5 signifies agreement between the spectroscopic and pho- 
tometric redshift estimates for that object, otherwise they 
do not agree. The confidence classes marked 4.x and 3.x are 
objects with very secure spectroscopic redshifts irrespective 
of their photometric redshifts. 

As detailed below, we restrict the redshift range of the 
sample to 0.2 < z < 1.0 and select galaxies to have V ^ 22.5. 
We exclude stars and broad line AGN from their spectro- 
scopic flags (as they are point sources, they can be photo- 
metrically identified and will not be used in a weak lensing 
analysis). Reliable redshifts (as defined above) are available 
for 83.5% of the remaining galaxies. Of these, 80% have very 
secure redshifts (99.5% reliable) based on the spectra alone. 
The remainder have less secure spectroscopic redshifts that 
are trusted only because they are consistent with the photo- 
z. Clearly, any of these galaxies which have a pathological 
spectral energy distribution yielding a wrong photo-z will 
have been excluded from our sample. However, any such 
pathological objects for which we secured a reliable redshift 
from the spectrum alone would be included and would be 
a "catastrophic failure" in the photo-z. It should be noted 
that the most secure spectro-z have a catastrophic failure 
rate in photo-z of about 1%. 

It should be recognized that this procedure, which we 
cannot avoid, means that the current analysis may underes- 
timate the number of "catastrophic failures" in the photo-z, 
and thus the photo-z quality, in the overall sample if the 
incidence of failure, or the photo-z error, are in some way 
correlated with t he difficulty of measurin g a purely spec- 
troscopic r e dshift ijflildebrandt et aTll2008r ). As discussed in 
iLillv et al l (|2007l ) there is in fact a very good correspon- 
dence between the spectroscopic repeatability of the red- 
shifts and the fraction of objects with consistent photo-z, 
indicating that the vast majority of the objects with less 
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Figure 2. The redshift distributions of the samples under study. 
The solid line gives the normalized distribution of the original set 
of zCOSMOS galaxies within 0.2 ^ * ^ 1.0 and V ^ 22.5, as 
described in the text. The shaded histogram gives the normal- 
ized distribution of the same set of galaxies after transforming to 
higher redshift using equation 1, spanning 0.65 < z < 1.8. For 
comparison, the dashed line gives the normalized predi cted N(z) 
for Iab 24.5 adopted from Amara & Rcfrcgicii l2007l 



secure spectroscopic redshifts that are inconsistent with the 
photo-z, are in fact wrong spectroscopic identifications. If 
the fraction of real "catastrophic failures" in the photo-z is 
not correlated with the success in measuring a spectroscopic 
redshift, then the number of such failures would be under- 
estimated in the current sample by 20%, i.e. of order 0.2% 
in the overall sample, which we believe is unimportant for 
the current purposes. 

This point does however emphasize the need to have 
large spectroscopic samples in the future with very high 
completeness in securing highly reliable redshifts if photo-z 
schemes are to be adequately tested and characterized at 
the levels of precision required. With this caveat in mind, 
the present purposes are nevertheless served by the avail- 
able spectroscopic sample. 



2.3 Generating the mock photometric catalogues 

As outlined above, we assume that bluer photometry of 
lower redshift galaxies (at 0.2 < z < 1.0) can be used as 
a substitute for redder photometry of higher redshift galax- 
ies. Specifically, we take actual broad band uBVrizJ pho- 
tometry from COSMOS to mimic the grizYJH bands for 
the simulated data. As shown in Figure-[T] this broadly cor- 
responds to a multiplicative transformation in wavelength 
given by A' = <5A, with 8 ~ 1.38. The match is naturally 
not perfect, but this should not matter since we will use 
the transformed COSMOS filter passbands in the photo-z 
code and it is unlikely that the precise location of photo- 
metric bands affects the photo-z performance. In Figure-[T] 
the vertical shaded regions give the wavelength ranges of 
the filters of Survey B, while the curves represent the broad 
band COSMOS filters after the above wavelength transfor- 
mation. Given that the match is not perfect, the photo-z 
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performance obtained in this paper should therefore not be 
regarded as exact "predictions" , but rather are intended to 
enable us to examine the general issue of "template error" 
in template fitting photo-z codes. 

This wavelength transformation is equivalent to a red- 
shift transformation of galaxies as follows: 



z =<5(l + z)-l 



(1) 



We first isolate all the galaxies in zCOSMOS-bright 
with secure redshifts 0.2 < z spec < 1.0 which do n ot lie 
in photometrically masked areas of lllbert et all (|2009h and 
which have complete photometric information in the se- 
lected bands. Since essentially all Iab ^ 22.5 galaxies have 
(V — I)ab > we can assume that this sample will be com- 
plete down to V ^ 22.5 and we retain only these galaxies. 
This gives us a sample of approximately 8500 galaxies down 
to V sC 22.5. 

To convert the available COSMOS photometry to sim- 
ulate Survey B we rename the filters uBVrizJ to grizYJH 
and add 2.0 magnitudes to all the photometric measure- 
ments of each galaxy. This creates a fainter photometric cat- 
alogue that should be complete in the "transformed I-band" 
to Itr,AB ^ 24.5. The filter response curves are carried over 
by shifting the wavelengths by a factor 1.38. 

Finally we apply the above redshift transformation to 
all the spectroscopic redshifts. This produces a new redshift 
distribution between 0.65 < z < 1.8 that is well matched to 
that expected, at z > 0.65, for an Iab ^ 24.5 sample in the 
sky. This is shown in Figure-[2]where the dashed line gives the 
normalized predicted redshift distribution for an / ^ 24.5 
sample, with a median re dshift z mc dian = 0.9, adopted from 
lAmara fc Refregierl 120071 . compared with the redshift dis- 
tribution of the original V ^ 22.5 zCOSMOS sample and 
the "transformed" Iab ^ 24.5 sample. The redshift range 
at z > 0.8 is where the performance of photo-z generally 
is less proven. Furthermore, it should be noted that the in- 
creased distance modulus (2.2 between z = 0.2 and z = 0.7 
and 1.6 between z = 1 and z — 1.8, allowing for a band- 
width stretching term) is well-matched to the 2.0 magni- 
tude increase in depth between V = 22.5 and Iab = 24.5. 
This means that the transformed simulated galaxies are well- 
matched in absolute magnitude to the lower redshift input 
sample. 

The catalogue thus created therefore contains actual 
photometric data from the COSMOS survey. As a result, 
It effectively eliminates the circularity problem of using the 
same SEDs to create a mock photometric catalogue and to 
estimate the photo-z of the galaxies. In view of the +2 mag 
transformation of the magnitudes, the noise in the COSMOS 
flux densities is artificially low. We therefore add additional 
Gaussian noise to the transformed flux densities in each 
pass-band. We use the adopted survey parameters as given 
in Table 1 of BLA-10 to define the required noise on each 
galaxy in each band, and add random noise in quadrature 
to the (known) uncertainties in the transformed COSMOS 
photometry so as to match this required final uncertainty. 

Although it eliminates the obvious circularity of us- 
ing templates to construct the catalogue, this procedure is 
clearly not perfect. It first assumes that any problems associ- 
ated with the templates are no more severe at 1.0 < z < 1.8 
than they are at 0.2 < z < 1.0. As noted above, the lu- 
minosities of the galaxies are rather similar at the two red- 



0.15- 

* 0.05; 

o : 

0.05 



f °; 

S -0.05 - 
-0.1 




-H- 



-H- 




0.6 



0.8 



1.2 



1.4 



1.6 



phot 



Figure 3. The overall photo-z performance for Survey B depth, 
based on a single "maximum likelihood" photo- z for each galaxy. 
The blue line gives the initial performance without any cleaning, 
the green line after "a priori" cleaning, which eliminates almost 
20% of galaxies, and the red line gives the performance after 
cleaning and after generating the probability distr ibution func- 
tions from the likelihoods as in lBordoloi et al.l 1120101). This F igure 
should be compared with Figure-2 in lBordoloi et al] {2010). 



shifts since the dimming corresponds quite closely to the 
change in distance modulus. Changes in metallicity and in 
the overall specific star-f ormation rates are certainly present 
between these red shifts <|Lillv et al.lll996l : lElbaz et af1l2007l: 
iDaddi et all l2007h but are not dramatic. iMannucci et al.l 
<|2010h have shown that galaxies up to z ~ 2.5 follow a funda- 
mental metallicity relation (FMR) with a residual dispersion 
of ~ 0.05 dex in metallicity( 12 %) and with no indication of 
evolution with redshift. Other more practical shortcomings 
are that problems associated with over-lapping objects (see 
BLA-10 for a discussion) will be less severe, since the parent 
photometric catalogue is effectively based on brighter galax- 
ies, with a roughly fourfold lower surface density on the sky. 
Furthermore, the dominant uncertainties in the photometry 
come from our added "artificial" Gaussian term. 



3 ESTIMATING THE PHOTO-Z 

The estimation of photo-z is done in the same way as was 
done in BLA-10, using the te mplate-fitting photo-z code 
ZEBRA (jFeldmann et al.ll2006h to calculate photo-z for the 
galaxies in the final simulated catalogue. ZEBRA gives both 
a single best fit "maximum likelihood" redshift and template 
type, together with confidence limits based on the \ 2 sur- 
faces, and normalized likelihood functions L(z) for the indi- 
vidual galaxies. The filter response curves used in ZEBRA 
for this analysis are generated from the actual COSMOS 
filter curves, shifted in wavelength by a factor of 1.38 as 
described above. 

We stress that the templates used in this work 
are purely synthe t ic ste llar populations generated from 
iBruzual fc Charlotl (|2003l ) evolutionary models and the pa- 
rameters varied to compute the whole set of templates are 
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Figure 4. Examples of reconstructed N(z) for maximum like- 
lihood redshifts (blue) after "a priori" cleaning of outliers, sum 
of the uncorrected likelihood functions (green) and sum of pdfs 
obtained by applying a statistical correction (red). The filled his- 
togram represents the true N(z) of the tomographic bin. The 
vertical lines show the mean redshift for each reconstruction. The 
sum of pdfs (red) represent the best reconstruction of the true 
N(z). 

age, star formation history (continuous underlying SFR su- 
perimposed with single or multiple bursts), dust obscura- 
tion in the galaxy and stellar metallicity. These templates 
include a range of internal reddening, which ranges from 
< A v < 2 magnitudes. Intergalactic absorption in each 
temp late is compensated for using the Madau law l|Madaul 
1 19951 ). A final set of 10,000 templates are produced which 
are used to estimate photo-z for the simulated survey. The 
standard sm all adjustment of photometric zero-points was 
undertaken ijllbert et al.l [2009). but we emphasize that the 
templates were not optimized using the COSMOS photome- 
try. 



4 PHOTO-Z PERFORMANCE 

In this section we assess the performance of the photometric 
redshift estimator as applied to our more realistic simulated 
photometry and quantify the results in comparison with the 
results of BLA-10. We follow closely the procedures of that 
first paper and refer the reader to it for further technical 
details. 

Galaxies are first binned in redshift on the basis of their 
observed maximum-likelihood p hoto-z. The bia s A z (z ) and 
dispersion cr z (z), as defined in iBordoloi et al.l (|2010l ). are 
computed as a function of the observed photo-z. As in BLA- 
10, the samples are cleaned by applying an "a priori" clean- 
ing algorithm that is based solely on the shape of the output 
L(z) function. The redshift probability distribution function 
(pdf) of each galaxy, „Sf (z), is constructed from its individual 
likelihood function L(z) using the procedure described by 
BLA-10. This is based on the principle that the real redshifts 
(for those galaxies with known redshifts) should, within a 
large enough sample, be uniformly distributed throughout 



their respective pdfs. Some set of galaxies with known (spec- 
troscopic) redshifts is therefore used to construct a mapping 
between L(z) and J?{z) which is assumed to be the same, 
in cumulative probability space, for all galaxies. The reader 
is referred to BLA-10 for details of this algorithm, and the 
mapping of L(z) to Jf(z) is given by the equation 12 and 
shown in Figure 3 of BLA-10. In the current simulation, 
this transformation is constructed using a random subset of 
~10% of the total sample, (i.e. of the order of 10 galaxies), 
the remaining 90% then serving as a test. 

In weak lensing tomography using photo-z, there are 
two principal requirements. First, the photo-z of individ- 
ual galaxies must be known to a precision of about a z < 
0.05(1 + z) in order that tomographic bins can be defined 
without significant overlap, thereby reducing the effects of 
"intrinsic alignments". Second, and more demanding, the 
N(z) of each of these bins must be known accurately for the 
correct quantitative interpretation of the lensing signal. It 
is found that the mean of the N(z ) is the most relevant pa- 
rameter l|Amara fc Refregierll2007l ). Hence we quantify our 
results in terms of this quantity. As noted above, to attain 
a precision of order 1% on the dark energy equation of state 
parameter, w, the mean of the N(z) for each redshift bin 
must be known to an accuracy of 0.002(1 + z), and we take 
this as our nominal requirement. 

As in BLA-10, we first look at the use of a single 
"maximum-likelihood" photo-z for each galaxy. In Figure- 
[3] we plot the r.m.s. dispersion o z (z) and the mean bias 
Az(z) derived from individual galaxies by comparing the 
maximum likelihood redshifts with the actual redshifts. The 
blue line gives the initial performance of the photo-z code, 
without any removal of outliers. The green line gives the per- 
formance after the "a priori" identification of outliers (rec- 
ognized purely from their likelihood curves, see BLA-10 for 
details) and the red line gives the performance after modi- 
fying the likelihood functions to produce the final pdfs, and 
using the peaks of the pdf. It can be seen that the photo-z 
performance improves after each of these steps, although it 
should be noted that almost 20% of galaxies are lost in the 
cleaning process - most of which would have had a quite 
usable photo-z. As in BLA-10, the required er z (z) can be 
achieved relatively easily. 

Turning to the more demanding problem of the error on 
the N(z), the use of maximum likelihood redshifts will not, 
by definition, include the wings of the N(z) or deal with 
any catastrophic failures in the photo-z. Figure-|4] shows a 
reconstructed tomographic bin for different methods. The 
true N(z) (shaded histogram) is best represented by the 
summing of pdfs obtained by applying the L(z) to Jif(z) 
transformation (red line). The mean of each reconstruction 
is shown by the vertical lines and for the above method the 
we obtain the minimum bias in the reconstruction. 

In Figure-[5] the blue diamonds show the bias in the 
mean of the N(z) using just the maximum likelihood red- 
shifts after "a priori" cleaning. The error bars are simply 
calculated as the expected uncertainty in the average red- 
shift based on Poisson statistics, knowing a z (z), i.e. we could 
not detect systematic biases below this level. The biases us- 
ing these maximum likelihood redshifts fall well short of the 
requirement of 0.002(1 + z) (shown by the shaded region) 
and are much larger than the Poisson uncertainties in our 
simulation. 
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Figure 5. The error in the mean of the N(z) for tomographic redshift bins for the simulated Survey B. The blue diamond symbols 
are obtained using only the maximum likelihood redshifts, the open squares are obtained from the sum of the uncorrected likelihood 
functions, and the filled circles by summing the pdfs obtained by applying a statistical correction. All three cases are after the "a priori" 
removal of about 20% of galaxies. The shaded region gives the nominal requirement of A< z > ^ 0.002(1 + z). 



As in BLA-10, some improvement is obtained by sum- 
ming the individual L(z) curves to define the N(z), shown 
as the red squares, but the required performance is only re- 
alized when we sum the individual pdfs that are obtained 
for each galaxy using the procedure outlined above, and de- 
scribed in more detail in BLA-10. These are shown as black 
filled circles. In this case, the error bars come from taking 
different subsets of the spectroscopic sample to define the 
L(z) to J£ (z) transformation. With this procedure, an un- 
certainty of 0.002(1 + z) is met for each of the bins. 

Comparing Figure-0 with Figure-2 in BLA-10, it can 
be seen that there is a degradation of performance in this 
more realistic simulation compared with that of our earlier 
paper. This is clearly the signature of "template error" and 
of the other realistic observational problems that are present 
in the current study through the use of actual photometric 
data. However, the degradation is relatively small: in order 
to achieve the same performance, about 7% more galaxies 
are lost due to "a priori" cleaning as compared to BLA- 
10. This modest degradation justifies our earlier assumption 
that template error was unlikely to be a dominant factor, at 
least with the typical S/N and number of filters expected 
for foreseeable all-sky photometric surveys. 

As described above, the required performance is met 
with of order 1000 objects to map this transformation. This 
number makes sense: In BLA-10 we pointed out the po- 
tential reduction in the number of spectroscopic redshifts 
needed if these were used to characterize the photo-z scheme 
rather than to estimate the N(z) directly. Our use of spec- 
troscopic redshifts to construct the L(z) to Ji?(z) mapping 



transformation is effectively an empirical determination of 
the errors on the redshifts. It is not surprising that, to go 
from a a z ~ 0.05 to a bias that is 30 times smaller, requires of 
order 30 2 known redshifts, c.f. equation (2) in BLA-10. This 
suggests that the number of spectroscopic redshifts would 
not increase dramatically as the total sample of photo-z ob- 
jects goes up. 

On the other hand, the discussion in Section 2.2 high- 
lights the dangers of using a spectroscopic sample that is not 
demonstrably complete, because of lingering concerns that 
the "catastrophic failure" rate of the photo-z, or even the 
statistical error a z , may be correlated with the success of 
measuring a spectroscopic redshift, and therefore that the 
set of galaxies used to set up the scheme may not be repre- 
sentative. Establishing that the calibrating sample is in fact 
completely representative, i.e. demonstrating to a possibly 
skeptical audience that the photo-z performance has been 
met, will likely require a very much larger sample. 



5 SUMMARY 

This paper is the second to focus on the problem of photo- 
z performance for weak-lensing cos mology from anticipa ted 
all-sky surveys. In a previous study ()Bordoloi et al.lr2010l ) we 
used a simulated photometric catalogue that was generated 
using the same set of templates as were then used to estimate 
the photo-z, thereby removing any issues associated with the 
choice of templates from the analysis. 

To avoid this potential circularity, we have used in 
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the current study a more realistic photometric catalogue 
that has been generated by degrading actual photometry 
of galaxies in the COSMOS field for which secure spectro- 
scopic redshifts are available from zCOSMOS. In order to 
generate a simulated photometric catalogue that extends to 
the required depth in the near-infrared, and to overcome 
the paucity of spectroscopic redshifts at z > 1, we employ 
the trick of basing the catalogue on bluer photometry of 
somewhat brighter objects at lower redshifts, in particular 
simulating an Iab ^ 24.5 sample at 0.65 < z < 1.8 from a 
V < 22.5 sample at 0.2 < z < 1.0. Such galaxies are well 
matched in luminosity. 

The photo-z are estimated using synthetic templates 
with no attempt at optimization, so in this sense no spec- 
troscopic redshifts are formally used for calibration of the 
photo-z. About 10% of the sample (~ 10 3 redshifts) are used 
in setting up the algorithm that is used to convert the likeli- 
hood functions to probability distribution functions, which 
is essential for the accurate estimate of the mean redshifts. 
We suspect that it is the total number and not the percent- 
age which will be relevant if this approach is applied to much 
larger samples. 

We find that, although the performance of the new sim- 
ulation is slightly worse than the previous, more idealized 
simulation, the degradation is quite small. In effect, 7% more 
galaxies must be eliminated through an "a priori" cleaning 
in order to attain the required performance. This supports 
our earlier assertion that "template error" would be unlikely 
to dominate the photo-z performance of surveys with the 
modest S/N and limited numbers of filters that are likely to 
be available in all-sky surveys. 

The new analysis therefore supports our earlier con- 
clusion that photo-z performance at the level required for 
precision cosmology are in principle achievable with foresee- 
able photometric surveys. While it indicates that a satisfac- 
tory photo-z scheme can be set up, it highlights however 
that large spectroscopic samples of very high completeness 
and with very reliable spectroscopic redshift determinations 
may be required in order to actually demonstrate that the 
required performance has been achieved, because of linger- 
ing concerns that redshift incompleteness may be correlated 
with photo-z: accuracy. This may ultimately be the limiting 
factor. 
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